' BIOTECHNOLOGY •.y^^^ggS^K 

COPY OF PAPERS SYSTEMS- ' >^^M^^feM, 

ORIGINALLY FILED BRANCH :^ff %aB» gggmg& 

RAW SEQUENCE LISTING 'WSflH^^'' 



ERROR REPORT 




The Biotechnology Systems Branch of the Scientific and-Tee^^i^f«|^j^f 
Center^(STIC) detected errors when processing the following computer readable 

form: 

Applimtinn M -""^ f ' O^J^i*?'! 0^> 

Source: 0*1 & ' " ' 



Date Processed by STIC: *° 



THE ATTACHED PRINTOUT EXPLAINS DETECTED ERRORS. < ^ 

p Tpa %V FORWARD THIS INFORMATION TO THE APPLICANT BY EITHER: 

1) ]*^^^^mWnamOVr IN YOUR NEXT COMMUNICATION TO THE 

APPI IPANT WITH A NOTICE TO COMPLY or, 

2) TELEroWW^^PLICANT AND FAXING A COPY OF THIS PRINTOUT, Wrril A 

FOR N ?SSSlON L QUESTIONS, PLEASE CONTACT MARK SPENCER, 703-308^211 

FOR SEQUENCE RULES INTERPRETATION, PLEASE CONTACT ^^^ 1 ^- 4216 
PATENTING e-mail help: p a tin21l.elp^uspt0.gov or P" 0 "^ 3 ^^^^^^ 
PATENTIN 3.0 e-mail help: pflfin3heln<ausnto.gov or phone 703.306-4119 (R. WMpVv. 

TO REDUCE ERRORED SEQUENCE LISTINGS, PLEASE USE THE CHECKER 
WPSTON 3.0 PROGRAM . ACCESSIBLE THROUGH THE U.S. PATENT AND 
TRADEMARK OFFICE WEBSITE. SEE BELOW: 

Checker Version 3.0 " ^ 

The Checker Version 3.0 application is a state-of the-art Windows based software program 
employing a logical and intuitive user-interface to check whether a sequence listing is in 
compliance with format and content rules. Checker Version 3.0 works for sequence listings 
generated for the original version of 37 CFR §§1.821 - 1 .825 effective October 1 1990 (old 
niles) and the revised version (new rules) effective July 1. 1998 as well as World Intellectual 
Property Organization (WIPO) Standard ST.25 

Checker Version 3 .0 replaces the previous DOS-based version of Checker, and is Y2K- 
compliant. Checker allows public users to check sequence listings in Computer Readable form 
(CRF) before submitting them to the United States Patent and Trademark Office (USPTO). 
Use of Checker prior to filing the sequence listing is expected to result in fewer errored sequence 
listings, thus saving time and money. 

Parker Version 3.0 can be down loaded from th » USPTO website at the following address; 

http://www.uspto.gov/web/officcs/pac/checker' 



Raw Sequence Listing Error Summary 



ERROR DETECTED SUCCESTED CORRECTION . SERIAL NUMBER: _ 

ATTN: NEW RULES CASES: r LEASE DISRECAJU) ENGLISH * ALPHA" HEADERS. WHICH WERE INSERTED BY l 

1 Wrapped Nudcics The luimbcrAcxl at the end of each line "wrapped* down to the next line. Jhis may occur if your fil 

Wrapped Aminos was retrieved in a word processor after creatine at Please adjust your right margin to J; this will 
prevent "wrapping." 



Jnvalid Line Length The rules require that a line not cicced 72 characters in IcngthV this includes white spaces. 



^Misaligned Amino 
Numbering 

Non-ASCII 



^Variable Length. 



Patcntln 2.0 
"bug** 



Skipped Sequences 
"(OLD RULES) 



10 



II 



12 



Use of n's or Xaa'i 

"(NEW RULES) 



Jnvalid <2 1 3> 
Response 



Use of<220> 



The numbering under each 5* amino acid is misaligned. Do not use tab codes between numbers; 
use space characters, instead. 

The submitted file was not saved in ASCII(DOS) text, as required by the Sequence Rules. Please 
ensure your subsequent submission Is aaved In ASCII left. 

\ i \ 1 

Sequencers) c ontain n's or XaVa representing more than, one residue. Per Sequence Rules, 
each ft or Xaa can only represent a single residue, please present the maximum number of each 
residue having variable length and indicate in the <220>-<223> section that some may be missing. 

A "bug" in Patcntln version 2.0*kas Cfttsed tVe <220>i<223>*ection to be missing from amino acid 

sequencers) . Normally, Patcntln would' automatiedfy^cnerate this section from the 

previously coded nucleic acid sequence. Please manually copy the relevant <220>-<223> section to 
the subsequent amino acid sequence. This applies to the mandatory <220>-<223> sections for 
ArttflcUt or Unknown sequence a. 

Sequencers) missing. If intentional, please insert the following lines for each skipped scquenc 

(2) INFORMATION FOR SEQ ID NO:X: (insert SEQ ID NO where T is shown) 
(i) SEQUENCE CHARACTERISTICS: (Do not insert any subheadings under this heading) 

(xi) SEQUENCE DESCRIPTIONS EQ ID NO:X: (insert SEQ'lD.NO where "X" it shown) 
This sequence is intentionally skipped 

Plcue also adjust the "(ii) NUMBER OF SEQUENCES:** response to Include the skipped sequence: 



Skipped Sequences' Sequencers) 
'(NEW RULES) 



_ missing. If Intentional, please insert the following lines for each skipped scque 



<2 10> sequence id number 
<40O> sequence id number 
000 

Use of n's and/or Xaa's have been detected in the Sequence Listing. 

Per 1.823 of Sequence Rules, use of <220>-<223> is MANDATORY if n's or Xaa's are present. 

In <220> to <223> section, please explain location of n or Xaa, and which residue n or Xai represent 

Pa 1.823 of Sequence Rules, the only valid <2I3> responses arc: Unknown, Artificial Sequence, or 
scientific 'name (Genus/species). <220>-<223> section is required when <2I3> response is Unknow 
is Artificial Sequence 



Sequencers) m 



_ missing the <220> "Feature** and associated numeric identifiers and responses 



Patcntln 2.0 
" "bug" 



Use of <220> to <223> is MANDATORY if <2I3> "Organism** response is "Artificial Sequence" or 

"Unknown/* Please explain source of genetic material in <220> I0 : <223> section. 

(See "Federal Reg facr." 06/01/1998. Vol. 63, No. 104. pp. 29631-32) (Sec. 1.823 of Sequence Rule 

Please do not use "Copy to Disk** function of Patcntln version 2.0. This causes a corrupted file, 
resulting in massing mandatory numeric identifiers and responses (a* indicated on raw sequence 
listing). Instead, please use "File Manager" or any other manual means to copy file to floppy disk. 



13 



- Misuse of n 



* h can only be used to represent a single nucleotide in a nucleic acid sequence. N ii not used to reprea 
any value not specifically a nucleotide. 
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OIPE 



RAW SEQUENCE LISTING DATE: 10/17/2001 

PATENT APPLICATION: US/09/915,706 TIME: 15:54:53 

Input Set : A:\5112.app 

Output Set: N:\CRF3\10172001\I915706.raw 

3 <110> APPLICANT: NELSON, DAVID R. 

5 <120> TITLE OF INVENTION: A LIVE, AVIRULENT STRAIN OF V. ANGU I LLARUM THAT 

6 PROTECTS FISH AGAINST INFECTION BY VIRULENT V. 

7 ANGU I LLARUM 

9 <130> FILE REFERENCE: 5112 
C--> 11 <140> CURRENT APPLICATION NUMBER: US/09/915,706 QTT 

C--> 12 <141> CURRENT FILING DATE: 2001-07-26 n c v _ rj om p|y 

14 <160> NUMBER OF SEQ ID NOS : 4 U093 \* j j ^ Hy , 
16 <170> SOFTWARE: Patentln Ver. 2.1 CorrBCtSJ 0 NefeUt,u 



ERRORED SEQUENCES 



18 <210> SEQ ID NO: 1 

19 <211> LENGTH: 3609 

20 <212> TYPE: DNA 

21 <213> ORGANISM: V. Anguillarum 

23 <220> FEATURE: 

24 <223> OTHER INFORMATION: "n" bases may be a, t, c, g, other or unknown 

26 <400> SEQUENCE: 1 

27 gtcgacttat tgcattgatg gcgtacatgg tagtgccatc cttcgtttgc taacaagcgt 60 

28 tgtataaaag . cttggtcggt ttcatcaagt tgaacacaat actcatgatt tttcccactt 120 

29 ccggaaaggg aaaagtgaaa atagcttttg agatcagcct gttctagcag cttttcaatg 180 

30 atctttttcg tcgttacgtt ttgaaaaatc tgacgactgc gtttgtattg caacaagcta 240 

31 agtggatcca atatctctat ttgataataa aactgctgct tgtctttgct atatcctgtg 300 

32 aattgcagag tgctacatat acctgaaaaa aaacgctttc cagaatctaa ttcgtaagac 360 

33 acacaaacag ctttacctag gtttttggta tcgatctcca tgtttgccgc gatggaaacg 420 

34 gaaaactgac acccgccgga tacgctttcc tctccgatta attgcgtgac aatataactt 4 80 

35 ttgctatctg aaagcttaat ggtgagggag cgggtttggt gctttaattc gttactgctc 540 

36 atattcaatt aattcactat taaataaaca gttctaaaag gctgtttatt ggatgaatat 600 

37 tcgaaattat cacataataa ttgatgctat tattacttgc tgtattggta tcaactttca 660 

38 tgctctatac atgtaatata tttcgagtta gaccttaatt caaggtaatt tgtctattta 720 

39 attattatct gaataatatg taatcgattg ctttgtggtt atttttatgt ttgtttcatt 7 80 

40 tttaatgacg gtgagcttgt gcattcatat tttttatgat gacaacatct ttgatgaagt 840 

41 atttaagata ttgttaatgc atgaggggtt tgcgtgtatt ttttatatta aatcataata 900 

42 aaatcaacaa tatatgttat tttgtgtctt tttatagtgt tcttttaaag aggtaggatg 960 

43 acctaaaggt cgcctagaaa tatggcgtaa attgccattg ctataattca cctcaaagat 1020 

44 acactattgg caaattgaca aatatgtcac ttcgtatgaa acaatattag tagatgttgt 1080 

45 ttttgctgca aaaataaaaa tttttctggt tgaaataact caaggcctct agcgttttcc 114 0 

46 tttatcttaa aatacaggaa atagcgattg aagttaattg acacttaagc aasdramgaa 1200 

47 tagtcaacct aacagagcag gaacctatgc ctttgtcaaa gcatcaaatt gagcaacttt 1260 

48 ctaaacctct gagtgatgat tcgatctgtg gcgtttatct taaactggaa aaaagtgctt 1320 

49 ttcgcccatt acgtaatgaa tttaatgtcg cgcaaactgc gctgcgtaag ctaagtcaaa 1380 

50 accctagtgc tgacgagaga gatgcgttac aagaggcatg tctaaataag tggaagattc 1440 

51 tctctgacag tttgtacgaa cagttttcaa aaacaaccag agatatcgag ctcatctcat 1500 

52 ggtttgttgc tgctcaattc cttctcgata ccacattaga aagtgctgcg aatagccttg 1560 

53 agtggttagc ggatttaagt gagaagcact gggatcacct caaccctgta ctaccagttg 1620 
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RAW SEQUENCE LISTING 

PATENT APPLICATION: US/09/915,706 



DATE: 10/17/2001 
TIME: 15:54:53 



Input Set : A:\5112.app 

Output Set: N:\CRF3\10172001\I915706.raw 



54 
55 
56 
57 
58 
59 
60 
61 
62 
63 
64 
65 
66 
67 
68 
69 

E--> 70 

71 
72 
73 
74 
75 
76 
77 
78 

E--> 79 

80 
81 
82 
83 
84 
85 

W--> 86 

87 



aaacgctcaa 
aagcattttt 
tgcaactgcc 
gcgaaatcag 
ttcaattcaa 
tggtgagcac 
cactgcttac 
aagcggaggc 
tgccaagcca 
agaccgtaag 
tgaaccgaga 
aaccgcatag 
tacctgagtt 
ttaatgccgc 
cggtgggcat 
gtgttgaaga 
aaccacaatc 
ggaaaaatca 
actatcggtc 
tactcttggg 
tcaggcatgg 
gacctactgt 
actaagcctt 
gcacgtttag 
tctctttctt 
caaagcggtg 
attctttcat 
tttaggacac 
gcatcaccta 
ttggcgtcat 
agcgagagtt 
cgcgtctttc 
acttgagcct 
cgcttaaag 



atctgatgat 
ccaactagtc 
cttagtcggg 
ccaactgaaa 
gatggaaaac 
taagtgtcat 
ccgtgttgaa 
caagacagta 
tatggataca 
ccaacactta 
cttagctttc 
cccaatttca 
gctgcgagaa 
cggattgaat 
tgaaagcccc 
gcatgtat£t 
atccgcta:^ 
tggcaagtat 
agctagaaac 
gtggcgctcg 
ttggcgtaag 
cttatttatt 
ctaacgatgg 
tttcttacaa 
acacttctat 
gtgttgtgac 
tagacatgcc 
cgttatggca 
tgacgttgaa 
tggcgacttt 
cacgggtatc 
gtacaaggtt 
ccgttcgatg 



gataagggca 

ggcgatagcg 

gaagtgacgt 

tctatgctta 

gccaaacgtt 

tctctaggca 

aacgctttgg 

gagcaagagg 

aaacatatag 

cacgcaggaa 

catttgttga 

tttttgttag 

atgatgtcgg 

catctcgatc 

caaacacctc 

cagacttccc 

drbcgtcggc 

ttacatgcgt 

ggctgaaggt 

taacgttgct 

cgaagttagc 

caacccaggt 

tcaaggtgca 

cgtgagcggg 

ttctcagaag 

ttacgaccta 

acgttaattg 

ttgaactcac 

acgaatggcg 

tcaggacaca 

gataaagaca 

gataacaagc 

aaagatttcc 



aagaaagaga 

aggaaagctc 

tttttgactt 

cgaccacggt 

gtgtcaccca 

gtcaaagtac 

ttcatctaag 

ttgccgaaag 

agcgaatacc 

acctctctga 

gagaagtctc 

aaaaagcgat 

aacaaaacgg 

aggttttgct 

aagcgaagcc 

ctgtagatac 

tctgagttgg 

gtaagcggtc 

aaaaatgacg 

atggacatcg 

gtaactaaag 

aaagacggta 

gacgtttact 

actgacggat 

catcactatg 

ccgaccggga 

gcatgtctat 

aacataagcg 

ccgtaaagac 

aaccagaatc 

acttcgatac 

ttgctaatga 

acccagagaa 



gcaagcagat 

gattctctat 

tcaaagtgca 

ggcgcaagag 

attagatcgt 

caacttcgga 

tggaattaag 

ttcagtttct 

gatggcatca 

actgggtaat 

tgattatttt 

tcgatgggga 

tgacgctctt 

gccggaggtg 

ttccgtttcg 

ccaatctaag 

taattgtgtt 

ttcaagttga 

gttggtttgc 

gtaacggcac 

aagtcgatgg 

aaactgttga 

tccaagttaa 

ctcaaccgta 

agaaagaagg 

aaa^gactt 

ttcatgaata 

cgttagtaag 

gaaagagctg 

agaaaaagtt 

agtgatgggg 

tgatagccag 

cttagttgati 



gcgaaagtta 
gcgccggtgc 
gagagaaaag 
cgtttcgcaa 
ttgtcagcgt 
tttgcgaagt 
ttagcaccga 
gaaggggagc 
gagcaggctc 
ttaaacaata 
cgccagagcg 
tatttatcct 
agtacgattt 
agtactccaa 
gatccgcgaa 
caagatcaaa 
taaaaaataa 
gggcgcagcg 
aatcaactct 
caatgcggat 
tgcttctgaa 
ggttgcattt 
gctagaaaaa 
cgagagccta 
tggtgaacta 
ctggtaagta 
tctcsdrcat 
aaccgtgtca 
ccgtttgttg 
gatttagaag 
caaattcacc 
tttgaagtga 
aattgagc 



1680 

1740 

1800 

1860 

1920 

1980 

2040 

2100 

2160 

2220 

2280 

2340 

2400 

2460 

2520 

2580 

2640 

2700 

2760 

2820 

2880 

2940 

3000 

3060 

3120 

3180 

3240 

3300 

3360 

3420 

3480 

3540 

3600 

3609 



i 



A 



Tfce type of errors shown exist throughout \ ^\ ^ 
the Sequence Listing. Please check subsequent 
sequences for similar errors. <j^ { Q; t j\y 
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VERIFICATION SUMMARY 

PATENT APPLICATION: US/09/915,706 



DATE: 10/17/2001 
TIME: 15:54:54 



Input Set : A:\5112.app 

Output Set: N:\CRF3\10172001\I915706.raw 



L:ll M:270 C: Current Application Number differs, Replaced Application Number 
L:12 M:271 C: Current Filing Date differs, Replaced Current Filing Date 
L:70 M:320 E: (1) Wrong Nucleic Acid Designator, NUMBER OF INVALID KEYS:1 
L:79 M:320 E: (1) Wrong Nucleic Acid Designator, NUMBER OF INVALID KEYS : 1 
L:86 M:258 W: Mandatory Feature missing, <221> not found for SEQ ID# : 1 
L:86 M:258 W: Mandatory Feature missing, <222> not found for SEQ ID# : 1 
L:86 M:341 W: (46) "n" or "Xaa" used, for SEQ ID# : 1 

L:253 M:258 W: Mandatory Feature missing, <221> not found for SEQ ID# : 4 
L:253 M:258 W: Mandatory Feature missing, <222> not found for SEQ ID# : 4 
L:253 M:341 W: (46) "n" or "Xaa" used, for SEQ ID# : 4 
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<400> 4 

Met Ala Leu Asn Ser Gin His Lys Arg Val Ser Lys Asn Arg Val Ser 
15 10 15 

lie Thr Tyr Asp Val Glu Thr Asn Gly Ala Val Lys Thr Lys Glu Leu 
20 25 30 

Pro Phe Val Val Gly Val lie Gly Asp Phe Ser Gly His Lys Pro Glu 
35 40 45 

Ser Glu Lys. Val Asp Leu Glu Glu Arg Glu Phe Thr Gly lie Asp Lys 
50 55 60 

Asp Asn Phe Asp Thr Val Met Gly Gin lie His Pro Arg Leu Ser Tyr 
65 70 75 80 

Lys Val Asp Asn Lys Leu Ala Asn Asp Asp Ser Gin Phe Glu Val Asn 
85 90 95 

Leu Ser Leu Arg Ser Met Lys Asp Phe His Pro Glu Asn Leu Val Asp 
100 105 110 



Xaa lie Glu Pro Leu 
115 



